NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

RAG Pipeline for Domain Specific Applications: A Case Study in Disseminating Dementia Care Practices

Cummings, Aaron; Zhang, Xinyue; Olaniran, Mercy; Akintomide, Modupe (June 2025, ACM/IEEE International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE '25))

In closed-domain Question Answering (QA), Large Language Models (LLMs) often fail to deliver responses specialized enough for niche subdomains. Broadly trained models may not capture the nuanced terminology and contextual precision required in these fields, which frequently lack domain-specific conversational data and face computational constraints. To address this, we propose a methodology leveraging a Retrieval-Augmented Generation (RAG) framework that integrates data extraction with fine-tuning using domain-specific question-answer pairs. Our approach employs Question-Answer Generation (QAG) to create tailored training datasets, enabling fine-tuned models to incorporate specialized jargon and context while remaining computationally accessible to domain experts. To exemplify this methodology, we demonstrate its application within the medical domain through a case study centered on the creation of a dementia care chat assistant. A significant benefit of this approach lies in its ease of replication across various domains and scalability for integration into diverse user groups, making it a versatile solution for enhancing chat assistants.
more » « less
Free, publicly-accessible full text available June 24, 2026
FedKDShap: Enhancing Federated Learning via Shapley Values Driven Knowledge Distillation on Non-IID Data

https://doi.org/10.1145/3701716.3717645

Shadin, Nazmus Shakib; Zhang, Xinyue (May 2025, ACM)

Free, publicly-accessible full text available May 8, 2026
A Machine Learning Approach for Emergency Detection in Medical Scenarios Using Large Language Models

Akaybicen, Ferit; Cummings, Aaron; Iwuagwu, Lota; Zhang, Xinyue; Adewuyi, Modupe (March 2025, International Symposium on Intelligent Computing and Networking (ISICN 2025))

Free, publicly-accessible full text available March 17, 2026
Accessing a New Population of Supermassive Black Holes with Extensions to the Event Horizon Telescope

https://doi.org/10.3847/1538-4357/adbd45

Zhang, Xinyue Alice; Ricarte, Angelo; Pesce, Dominic W; Johnson, Michael D; Nagar, Neil; Narayan, Ramesh; Ramakrishnan, Venkatessh; Doeleman, Sheperd; Palumbo, Daniel_C M (May 2025, The Astrophysical Journal)

Abstract The Event Horizon Telescope (EHT) has produced resolved images of the supermassive black holes (SMBHs) Sgr A* and M87*, which present the largest shadows on the sky. In the next decade, technological improvements and extensions to the array will enable access to a greater number of sources, unlocking studies of a larger population of SMBHs through direct imaging. In this paper, we identify 12 of the most promising sources beyond Sgr A* and M87* based on their angular size and millimeter flux density. For each of these sources, we make theoretical predictions for their observable properties by ray tracing general relativistic magnetohydrodynamic models appropriately scaled to each target’s mass, distance, and flux density. We predict that these sources would have somewhat higher Eddington ratios than M87*, which may result in larger optical and Faraday depths than previous EHT targets. Despite this, we find that visibility amplitude size constraints can plausibly recover masses within a factor of 2, although the unknown jet contribution remains a significant uncertainty. We find that the linearly polarized structure evolves substantially with the Eddington ratio, with greater evolution at larger inclinations, complicating potential spin inferences for inclined sources. We discuss the importance of 345 GHz observations, milli-Jansky baseline sensitivity, and independent inclination constraints for future observations with upgrades to the EHT through ground updates with the next-generation EHT program and extensions to space through the black hole Explorer.
more » « less
Free, publicly-accessible full text available May 13, 2026
Towards More Robust and Scalable Deep Learning Systems for Medical Image Analysis

https://doi.org/10.1109/BigData62323.2024.10825626

Yenumala, Akshaj; Zhang, Xinyue; Lo, Dan (December 2024, IEEE)

Deep learning (DL) has attracted interest in healthcare for disease diagnosis systems in medical imaging analysis (MedIA) and is especially applicable in Big Data environments like federated learning (FL) and edge computing. However, there is little research into mitigating the vulnerabilities and robustness of such systems against adversarial attacks, which can force DL models to misclassify, leading to concerns about diagnosis accuracy. This paper aims to evaluate the robustness and scalability of DL models for MedIA applications against adversarial attacks while ensuring their applicability in FL settings with Big Data. We fine-tune three state-of-the-art transfer learning models, DenseNet121, MobileNet-V2, and ResNet50, on several MedIA datasets of varying sizes and show that they are effective at disease diagnosis. We then apply the Fast Gradient Sign Method (FGSM) to attack the models and utilize adversarial training (AT) and knowledge distillation to defend them. We provide a performance comparison of the original transfer learning models and the defended models on the clean and perturbed data. The experimental results show that the defensive techniques can improve the robustness of the models to the FGSM attack and be scaled for Big Data as well as utilized for edge computing environments.
more » « less
Free, publicly-accessible full text available December 15, 2025
Practical Considerations of Fully Homomorphic Encryption in Privacy-Preserving Machine Learning

https://doi.org/10.1109/bigdata62323.2024.10825068

Lo, Dan Chia-Tien; Shi, Yong; Shahriar, Hossain; Deng, Bobin; Zhang, Xinyue; Chen, Mei-Lan (December 2024, IEEE)

Machine learning has been successfully applied to big data analytics across various disciplines. However, as data is collected from diverse sectors, much of it is private and confidential. At the same time, one of the major challenges in machine learning is the slow training speed of large models, which often requires high-performance servers or cloud services. To protect data privacy while still allowing model training on such servers, privacy-preserving machine learning using Fully Homomorphic Encryption (FHE) has gained significant attention. However, its widespread adoption is hindered by performance degradation. This paper presents our experiments on training models over encrypted data using FHE. The results show that while FHE ensures privacy, it can significantly degrade performance, requiring complex tuning to optimize.
more » « less
Free, publicly-accessible full text available December 15, 2025
Hybrid Quantum Classical Machine Learning with Knowledge Distillation

https://doi.org/10.1109/ICC51166.2024.10622755

Li, Mingze; Fan, Lei; Cummings, Aaron; Zhang, Xinyue; Pan, Miao; Han, Zhu (June 2024, IEEE)

Full Text Available
Polymer-Unit Graph: Advancing Interpretability in Graph Neural Network Machine Learning for Organic Polymer Semiconductor Materials

https://doi.org/10.1021/acs.jctc.3c01385

Zhang, Xinyue; Sheng, Ye; Liu, Xiumin; Yang, Jiong; Goddard_III, William A; Ye, Caichao; Zhang, Wenqing (April 2024, Journal of Chemical Theory and Computation)

Full Text Available
Serverless-DFS: Serverless Federated Learning with Dynamic Forest Strategy

https://doi.org/10.1145/3638837.3638865

Deng, Bobin; Zhang, Xinyue; Lo, Dan Chia-Tien (December 2023, ACM)
EEFL: High-Speed Wireless Communications Inspired Energy Efficient Federated Learning over Mobile Devices

https://doi.org/10.1145/3581791.3596865

Chen, Rui; Wan, Qiyu; Zhang, Xinyue; Qin, Xiaoqi; Hou, Yanzhao; Wang, Di; Fu, Xin; Pan, Miao (June 2023, ACM International Conference on Mobile Systems, Applications, and Services)

Full Text Available

« Prev Next »

Search for: All records